Reinforcement learning

Results: 1147



#Item
131Applied mathematics / Artificial intelligence / XT / Learning / Reinforcement learning / Artificial neural network / Machine learning / Supervised learning

Temporal Difference Learning to Detect Unsafe System States Huazhong Ning∗ , Wei Xu† , Yue Zhou∗ , Yihong Gong† , Thomas Huang∗ ∗ ECE Department, U. of Illinois at Urbana-Champaign, Urbana, IL 61801. {hning2,

Add to Reading List

Source URL: www.ifp.illinois.edu

Language: English - Date: 2008-08-06 16:36:02
132Robot control / Humanoid robot / Mobile robot / Robotics / Zero moment point / Bipedalism / Walking / Reinforcement learning / Artificial intelligence / Biota / Learning

German Journal on Artificial Intelligence (KI), Springer, to appearNoname manuscript No. (will be inserted by the editor) Online Learning of Bipedal Walking Stabilization

Add to Reading List

Source URL: www.ais.uni-bonn.de

Language: English - Date: 2015-06-26 14:32:15
133Statistics / Statistical theory / Probability / Bayesian statistics / Dynamic programming / Markov processes / Stochastic control / Markov decision process / Reinforcement learning / Q-learning / Prior probability / Conjugate prior

Competing with Humans at Fantasy Football: Team Formation in Large Partially-Observable Domains

Add to Reading List

Source URL: www.intelligence.tuc.gr

Language: English - Date: 2012-04-19 16:26:14
134Cognitive science / Artificial intelligence / Cognition / Philosophy / Game theory / Computational trust / Computer access control / Key management / Software agent / Reinforcement learning / Practical reason / Trust

Sequential Decision Making with Untrustworthy Service Providers W. T. Luke Teacy, Georgios Chalkiadakis, Alex Rogers and Nicholas R. Jennings Electronics and Computer Science,University of Southampton Southampton, SO17 1

Add to Reading List

Source URL: www.intelligence.tuc.gr

Language: English - Date: 2008-02-08 15:13:13
135Machine learning / Artificial intelligence / Learning / Applied mathematics / Computational neuroscience / Dynamic programming / Stochastic control / Reinforcement learning / Apprenticeship learning / Markov decision process / Artificial neural network / Supervised learning

Around Inverse Reinforcement Learning and Score-based Classification Matthieu Geist IMS - MaLIS Research Group (Supélec) Metz, France

Add to Reading List

Source URL: www.metz.supelec.fr

Language: English - Date: 2014-01-18 03:53:53
136Mathematics / Mathematical optimization / Dynamic programming / Mathematical analysis / Equations / Operations research / Systems theory / Stochastic control / Bellman equation / Markov decision process / Q-learning / Reinforcement learning

Increasing the Action Gap: New Operators for Reinforcement Learning Marc G. Bellemare and Georg Ostrovski and Arthur Guez Philip S. Thomas∗ and R´emi Munos Google DeepMind {bellemare,ostrovski,aguez,munos}@google.com;

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2015-12-12 00:05:18
137Cognitive science / Cognition / Artificial intelligence / Machine learning / Belief revision / Reinforcement learning / Temporal difference learning / Q-learning / Feature selection / Supervised learning / Proto-value functions / Action selection

Evolutionary Feature Evaluation for Online Reinforcement Learning

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
138Belief revision / Reinforcement learning / Caregiver / Robotics / Human development / Psychoanalysis / Learning / Personal life / Behavior

Enhancing Agent Safety through Autonomous Environment Adaptation Benjamin Rosman Bradley Hayes

Add to Reading List

Source URL: bradhayes.info

Language: English - Date: 2016-07-11 15:51:27
139Numerical analysis / Applied mathematics / Statistics / Markov models / Mathematical optimization / Operations research / Reinforcement learning / Expectationmaximization algorithm / Sine / Proximal gradient method / Gradient method / Loss function

Trust Region Policy Optimization arXiv:1502.05477v4 [cs.LG] 6 Jun 2016 John Schulman JOSCHU @ EECS . BERKELEY. EDU

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2016-06-06 20:48:19
140Markov models / Behaviorism / Markov processes / Psychology / Addiction / Behavior therapy / Reinforcement / Markov chain / Behavior / Learning

Advances in Theoretical Economics Volume , Issue   Article 

Add to Reading List

Source URL: people.bu.edu

Language: English - Date: 2006-03-02 12:30:18
UPDATE